AutoScaler: Scale-Attention Networks for Visual Correspondence

نویسندگان

  • Shenlong Wang
  • Linjie Luo
  • Ning Zhang
  • Jia Li
چکیده

Finding visual correspondence between local features is key to many computer vision problems. While defining features with larger contextual scales usually implies greater discriminativeness, it could also lead to less spatial accuracy of the features. We propose AutoScaler, a scale-attention network to explicitly optimize this trade-off in visual correspondence tasks. Our architecture consists of a weight-sharing feature network to compute multi-scale feature maps and an attention network to combine them optimally in the scale space. This allows our network to have adaptive sizes of equivalent receptive field over different scales of the input. The entire network can be trained end-to-end in a Siamese framework for visual correspondence tasks. Using the latest off-the-shelf architecture for the feature network, our method achieves competitive results compared to state-of-the-art methods on challenging optical flow and semantic matching benchmarks, including Sintel, KITTI and CUB-2011. We also show that our attention network alone can be applied to existing hand-crafted feature descriptors (e.g Daisy) and improve their performance on visual correspondence tasks. Finally, we illustrate how the scaleattention maps generated from the attention network are visually interpretable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Frequency-specific electrophysiologic correlates of resting state fMRI networks

Resting state functional MRI (R-fMRI) studies have shown that slow (<0.1Hz), intrinsic fluctuations of the blood oxygen level dependent (BOLD) signal are temporally correlated within hierarchically organized functional systems known as resting state networks (RSNs) (Doucet et al., 2011). Most broadly, this hierarchy exhibits a dichotomy between two opposed systems (Fox et al., 2005). One system...

متن کامل

Improving Energy-Efficient Target Coverage in Visual Sensor Networks

Target coverage is one of the important problems in visual sensor networks. The coverage should be accompanied with an efficient use of energy in order to increase the network lifetime. In this paper, we address the maximum lifetime for visual sensor networks (MLV) problem by maximizing the network lifetime while covering all the targets. For this purpose, we develop a simulated annealing (SA) ...

متن کامل

Visual Attention and Novelty Detection: Experiments with Automatic Scale Selection∗

We present experiments with an autonomous inspection robot, whose task was to highlight novel features in its environment using camera images. Experiments were conducted with two different attention mechanisms — saliency map and multiscale Harris detector — and two different novelty detection mechanisms — the Grow-WhenRequired neural network and incremental PCA. For both mechanisms we compared ...

متن کامل

Action Classification and Highlighting in Videos

Inspired by recent advances in neural machine translation, that jointly align and translate using encoder-decoder networks equipped with attention, we propose an attentionbased LSTM model for human activity recognition. Our model jointly learns to classify actions and highlight frames associated with the action, by attending to salient visual information through a jointly learned soft-attention...

متن کامل

An Approach for Target Detection and Extraction Based on Biological Vision

Inspired by the mechanism of multi-scale image fusion of insect compound eye, this paper proposed a target detection and extraction method based on insect compound eye and human visual attention mechanism. The main feature of this method is that multi-scale visual attention mechanism is designed for improving the detection accuracy of interested target, meanwhile image is pre-segment based on i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1611.05837  شماره 

صفحات  -

تاریخ انتشار 2016